Search CORE

495 research outputs found

Données confidentielles : génération de jeux de données synthétisés par forêts aléatoires pour des variables catégoriques

Author: Caron Maxime
Publication venue
Publication date: 23/04/2018
Field of study

La confidentialité des données est devenue primordiale en statistique. Une méthode souvent utilisée pour diminuer le risque de réidentification est la génération de jeux de données partiellement synthétiques. On explique le concept de jeux de données synthétiques, et on décrit une méthode basée sur les forêts aléatoires pour traiter les variables catégoriques. On s’intéresse à la formule qui permet de faire de l’inférence avec plusieurs jeux synthétiques. On montre que l’ordre des variables à synthétiser a un impact sur l’estimation de la variance des estimateurs. On propose une variante de l’algorithme inspirée du concept de confidentialité différentielle. On montre que dans ce cas, on ne peut estimer adéquatement ni un coefficient de régression, ni sa variance. On montre l’impact de l’utilisation de jeux synthétiques sur des modèles d’équations structurelles. On conclut que les jeux synthétiques ne changent pratiquement pas les coefficients entre les variables latentes et les variables mesurées.Confidential data are very common in statistics nowadays. One way to treat them is to create partially synthetic datasets for data sharing. We will present an algorithm based on random forest to generate such datasets for categorical variables. We are interested by the formula used to make inference from multiple synthetic dataset. We show that the order of the synthesis has an impact on the estimation of the variance with the formula. We propose a variant of the algorithm inspired by differential privacy, and show that we are then not able to estimate a regression coefficient nor its variance. We show the impact of synthetic datasets on structural equations modeling. One conclusion is that the synthetic dataset does not really affect the coefficients between latent variables and measured variables

CorpusUL

Conception de microARNs pour attenuer l'expression de genes

Author: Caron Maxime
Publication venue
Publication date: 01/09/2008
Field of study

Les microARNs appartiennent à la famille des petits ARNs non-codants et agissent comme inhibiteurs des ARN messagers et/ou de leurs produits protéiques. Les mi- croARNs sont différents des petits ARNs interférants (siARN) car ils atténuent l’ex- pression au lieu de l’éliminer. Dans les dernières années, de nombreux microARNs et leurs cibles ont été découverts chez les mammifères et les plantes. La bioinforma- tique joue un rôle important dans ce domaine, et des programmes informatiques de découvertes de cibles ont été mis à la disposition de la communauté scientifique. Les microARNs peuvent réguler chacun des centaines de gènes, et les profils d’expression de ces derniers peuvent servir comme classificateurs de certains cancers. La modélisation des microARNs artificiels est donc justifiable, où l’un pourrait cibler des oncogènes surexprimés et promouvoir une prolifération de cellules en santé. Un outil pour créer des microARNs artificiels, nommé MultiTar V1.0, a été créé et est disponible comme application web. L’outil se base sur des propriétés structurelles et biochimiques des microARNs et utilise la recherche tabou, une métaheuristique. Il est démontré que des microARNs conçus in-silico peuvent avoir des effets lorsque testés in-vitro. Les sé- quences 3’UTR des gènes E2F1, E2F2 et E2F3 ont été soumises en entrée au programme MultiTar, et les microARNs prédits ont ensuite été testés avec des essais luciférases, des western blots et des courbes de croissance cellulaire. Au moins un microARN artificiel est capable de réguler les trois gènes par essais luciférases, et chacun des microARNs a pu réguler l’expression de E2F1 et E2F2 dans les western blots. Les courbes de crois- sance démontrent que chacun des microARNs interfère avec la croissance cellulaire. Ces résultats ouvrent de nouvelles portes vers des possibilités thérapeutiques.MicroRNAs belong to the family of small non-coding RNAs and act as down regula- tors of messenger RNAs and/or their protein products. microRNAs differ from siRNAs by downregulating instead of shutting down. In recent years, numerous microRNAs and their targets have been found in mammals and plants. Bioinformatics plays a big role in this field, as software has emerged to find new microRNA targets. Each individual microRNA can regulate hundreds of genes, and it has been shown that microRNA expression profiles can classify human cancers. The need for artificially created mi- croRNAs is then justified, as one could target overexpressed oncogenes and promote healthy cell proliferation. MultiTar V1.0, a tool for creating artificial microRNAs, has been implemented and is available as a web application. The tool relies on structural and biological properties of microRNAs and uses a Tabusearch metaheuristic. A typical biological problem is presented and it is shown that an in-silico microRNA has in-vitro effects. The 3’UTR sequences of E2F1, E2F2 and E2F3 were given as input to the tool, and predicted microRNAs were then tested using luciferase essays, western blots and growth curves. At least one microRNA is able to regulate the three genes with luciferase essays and all of the created microRNAs were able to regulate the expres- sion of E2F1 and E2F2 with western blots. Growth curves were also studied in order to investigate overall biological effects, and reduction in growth was observed for all solutions. Results obtained with the predicted microRNAs and the target genes open a new door into therapeutic possibilities

Dépôt Institutionnel Numérique

Conception et mise au point d'un module de connexion réseau modulaire, bidirectionnel en courant et isolé

Author: Caron Maxime
Publication venue: École de technologie supérieure
Publication date
Field of study

La nécessité de réduire le temps de développement des convertisseurs de puissance et d’augmenter l’efficacité des équipements de test ne cesse de croitre. Le développement continuel de technologie utilisant l’électricité, tel l’électrification de l’automobile contribue à accélérer cette tendance. De plus, le prix de l’énergie étant sans cesse à la hausse, l’intérêt d’introduire des équipements de tests régénératifs pour la validation des convertisseurs de puissance gagne en importance. Les efforts déployés dans le projet de ce mémoire font suite au développement des émulateurs de charges régénératifs. Ce type de charge intelligente nécessite une interface de connexion avec le réseau. Pour certaines applications, cette interface doit être bidirectionnelle et présenter une isolation galvanique. Par exemple, les amplificateurs de puissance utilisés pour émuler le comportement d’une source de tension alternative, tels un réseau de distribution ou un moteur, peuvent fonctionner dans les quatre cadrans de couranttension. Il importe donc de fournir une connexion bidirectionnelle avec le réseau à ce type de convertisseur. D’autre part, dans la foulée du développement des convertisseurs multiniveau, la caractéristique isolée prend tout son sens, permettant de connecter plusieurs niveaux de tension de façon aléatoire. Le développement d’une unité de connexion réseau effectuant une conversion CA-CC est donc traité dans ce travail. La conception et l’optimisation d’un convertisseur CC-CC et d’un convertisseur CA-CC sont traitées. La modélisation, la simulation, la conception et les tests expérimentaux sur un prototype de 5kW sont effectués. La stabilité de l’interconnexion entre les deux convertisseurs est également analysée et testée en pratique

Espace ÉTS

An adaptive multi-agent system for task reallocation in a MapReduce job

Author: Baert Quentin
Caron Anne-Cécile
Morge Maxime
Routier Jean-Christophe
Stathis Kostas
Publication venue: 'Elsevier BV'
Publication date: 01/04/2021
Field of study

International audienceWe study the problem of task reallocation for load-balancing of MapReduce jobs in applications that process large datasets. In this context, we propose a novel strategy based on cooperative agents used to optimise the task scheduling in a single MapReduce job. The novelty of our strategy lies in the ability of agents to identify opportunities within a current unbalanced allocation, which in turn trigger concurrent and one-to-many negotiations amongst agents to locally reallocate some of the tasks within a job. Our contribution is that tasks are reallocated according to the proximity of the resources and they are performed in accordance to the capabilities of the nodes in which agents are situated. To evaluate the adaptivity and responsiveness of our approach, we implement a prototype test-bed and conduct a vast panel of experiments in a heterogeneous environment and by exploring varying hardware configurations. This extensive experimentation reveals that our strategy significantly improves the overall runtime over the classical Hadoop data processing

Royal Holloway - Pure

HAL Descartes

Hal-Diderot

Fluorescent labeling in semi-solid medium for selection of mammalian cells secreting high-levels of recombinant proteins

Author: Ba Ismaïla
Caron Antoine W
Gaillet Bruno
Garnier Alain
Gilbert Rénald
Massie Bernard
Nicolas Claire
Pinard Maxime
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Despite the powerful impact in recent years of gene expression markers like the green fluorescent protein (GFP) to link the expression of recombinant protein for selection of high producers, there is a strong incentive to develop rapid and efficient methods for isolating mammalian cell clones secreting high levels of marker-free recombinant proteins. Recently, a method combining cell colony growth in methylcellulose-based medium with detection by a fluorescently labeled secondary antibody or antigen has shown promise for the selection of Chinese Hamster Ovary (CHO) cell lines secreting recombinant antibodies. Here we report an extension of this method referred to as fluorescent labeling in semi-solid medium (FLSSM) to detect recombinant proteins significantly smaller than antibodies, such as IGF-E5, a 25 kDa insulin-like growth factor derivative. Results CHO cell clones, expressing 300 μg/ml IGF-E5 in batch culture, were isolated more easily and quickly compared to the classic limiting dilution method. The intensity of the detected fluorescent signal was found to be proportional to the amount of IGF-E5 secreted, thus allowing the highest producers in the population to be identified and picked. CHO clones producing up to 9.5 μg/ml of Tissue-Plasminogen Activator (tPA, 67 kDa) were also generated using FLSSM. In addition, IGF-E5 high-producers were isolated from 293SF transfectants, showing that cell selection in semi-solid medium is not limited to CHO and lymphoid cells. The best positive clones were collected with a micromanipulator as well as with an automated colony picker, thus demonstrating the method's high throughput potential. Conclusion FLSSM allows rapid visualization of the high secretors from transfected pools prior to picking, thus eliminating the tedious task of screening a high number of cell isolates. Because of its rapidity and its simplicity, FLSSM is a versatile method for the screening of high producers for research and industry.</p

NRC Publications Archive

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Allocation équitable de tâches pour l'analyse de données massives

Author: Baert Quentin
Caron Anne-Cécile
Morge Maxime
Routier Jean-Christophe
Publication venue: Cépaduès éditions
Publication date: 05/10/2016
Field of study

L'URL de l'ouvrage est la suivante:http://www.cepadues.com/livres/jfsma-2016-systemes-multi-agents-simulations-9782364935594.htmlInternational audienceMany companies are using MapReduce applications to process very large amounts of data. Static optimization of such applications is complex because they are based on user-defined operations, called map and reduce, which prevents some algebraic optimization. In order to optimize the task allocation, several systems collect data from previous runs and predict the performance doing job profiling. However they are not effective during the learning phase, or when a new type of job or data set appears. In this paper, we present an adaptive multiagent system for large data sets analysis with MapReduce. We do not preprocess data and we adopt a dynamic approach, where the reducer agents interact during the job. In order to decrease the workload of the most loaded reducer - and so the execution time - we propose a task re-allocation based on negotiation.De nombreuses entreprises utilisent l'application MapReduce pour le traitement de données massives. L'optimisation statique de telles applications est complexe car elles reposent sur des opérations définies par l'utilisateur, appelées map et reduce, ce qui empêche une optimisation algébrique. Afin d'optimiser l'allocation des tâches, plusieurs systèmes collectent des données à partir des exécutions précédentes et prédisent les performances en faisant une analyse de la tâche. Cependant, ces systèmes ne sont pas efficaces durant la phase d'apprentissage ou lorsqu'un nouveau type de tâches ou de données apparait. Dans ce papier, nous présentons un système multi-agents adaptatif pour l'analyse de données massives avec MapReduce. Nous ne pré-traitons pas les données et adoptons une approche dynamique où les agents reducers interagissent durant l'exécution. Nous proposons une ré-allocation des tâches basée sur la négociation pour parvenir à faire décroitre la charge de travail du plus chargé des agents reducers et ainsi réduire le temps d'exécution

HAL Descartes

Hal-Diderot

Délégation de lots de tâches pour la réduction de la durée moyenne de réalisation

Author: Anne-Cécile Caron
Ellie Beauprez
Jean-Christophe Routier
Maxime Morge
Publication venue: 'Cellule MathDoc/CEDRAM'
Publication date: 01/01/2023
Field of study

Revue Ouverte d’Intelligence Artificielle

Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs

Author: Alicia Schiavi
Alison H Goodall
Ann‐Christine Syvänen
Bing Ge
Chuan Wang
Francois Cambien
Jonas Carlsson Almlöf
Lars Rönnblom
Maxime Caron
Nicholas Light
Panos Deloukas
Per Lundmark
Shu‐Huang Chen
Tomi Pastinen
Tony Kwan
Veronique Adoue
Wang Y
Willem H Ouwehand
Publication venue: 'EMBO'
Publication date: 01/01/2014
Field of study

This is an open access article under the terms of the Creative Commons Attribution 4.0 License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited

Crossref

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Queen Mary Research Online

Leicester Research Archive

A Location-Aware Strategy for Agents Negotiating Load-balancing

Author: Baert Quentin
Caron Anne-Cécile
Morge Maxime
Routier Jean-Christophe
Stathis Kostas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 04/11/2019
Field of study

International audienceWe study a novel location-aware strategy for distributed systems where cooperating agents perform the load-balancing. The strategy allows agents to identify opportunities within a current unbalanced allocation , which in turn triggers concurrent and one-to-many negotiations amongst agents to locally reallocate some tasks. The tasks are reallocated according to the proximity of the resources and they are performed in accordance with the capabilities of the nodes in which agents are situated. This dynamic and ongoing negotiation process takes place concurrently with the task execution and so the task allocation process is adaptive to disruptions (task consumption, slowing down nodes). We evaluate the strategy in a multi-agent deployment of the MapReduce design pattern for processing large datasets. Empirical results demonstrate that our strategy significantly improves the overall runtime of the data processing

Royal Holloway - Pure

HAL Descartes

Hal-Diderot

Stratégie situationnelle pour l'équilibrage de charge

Author: Baert Quentin
Caron Anne-Cécile
Morge Maxime
Routier Jean-Christophe
Stathis Kostas
Publication venue: Cépaudès
Publication date: 03/07/2019
Field of study

National audienceWe study a novel location-aware strategy for distributed systems where cooperating agents perform the load-balancing. The strategy allows agents to identify opportunities within a current unbalanced allocation, which in turn triggers concurrent and one-to-many negotiations amongst agents to locally reallocate some tasks. The tasks are reallocated according to the proximity of the resources and they are performed in accordance with the capabilities of the nodes in which agents are situated. This dynamic and ongoing negotiation process takes place concurrently with the task execution and so the task allocation process is adaptive to disruptions (task consumption, slowing down nodes). We evaluate the strategy in a multi-agent deployment of the MapReduce design pattern for processing large datasets. Empirical results demonstrate that our strategy significantly improves the overall runtime of the data processing.Nous étudions une stratégie qui tient compte de la localité des ressources pour équilibrer les charges dans un système distribué. Cette stratégie permet aux agents coopératifs d'identifier une allocation non équilibrée, voire de déclencher des enchères concurrentes pour réallouer localement certaines des tâches. Les tâches sont réallouées en tenant compte de l'accessibilité des ressources pour les agents ; elles sont exécutées conformément aux capacités des noeuds de calcul sur lesquels se trouvent les agents. Ce processus de négociation dynamique et continu est concurrent à l'exécution des tâches, ce qui permet d'adapter l'allocation des tâches aux perturbations (exécution de tâche, chute de performance d'un nœud). Nous évaluons cette stratégie dans le cadre du déploiement multi-agents de MapReduce. Ce patron de conception permet le traitement distribué de données massives. Les résultats empiriques démontrent que notre stratégie améliore significativement le temps d'exécution du traitement d'un jeu de données

Hal-Diderot